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CSJ Abstract 

The function h{k) represents the smallest number m such that every sequence of m 

consecutive integers contains an integer coprime to the first k primes. We give a new 

Q-i computational method for calculating strong upper bounds on h(k). 

<D 

2 1 Introduction 

^""j " 1 Letting pi be the ith prime and be the product of the all primes up to p^, Jacobsthal's 

function h(k) represents the smallest number m such that every sequence of m consecutive 
integers contains an integer coprime to The function h(k) has been studied by a number 
of different authors, and is central to results on the maximal gaps between consecutive 
primes [6], [7] and on the least prime in arithmetic progressions [8). Explicit values of h(k) 
are known only for k < 49, with the computation of /i(49) by Hagedorn taking 3 months on 
^ a cluster of 30 computers (2]. 

Let g(n) represent the smallest number m such that every sequence of m consecutive 
integers contains an integer coprime to n. In a letter to Erdos IT], Jacobsthal asked whether 

in h(k) <C k 2 

g(n) < h(k) for all n with k distinct prime factors 

both hold for all positive k and some constant C. Iwaniec's proof E] that 

h(k) <C {k log k) 2 



X 

for an unknown constant C gives our closest approach to the first of these conjectures. Hadju 
and Saradha [3] recently disproved the second conjecture using the explicit values of h{k) 
calculated by Hagedorn. 

The best known explicit upper bounds on h(k), of 

h(k) < 2 k 

h(k) < 2k 2+2e log k 

are due to Kanold [5] and Stevens [9] respectively, with the second bound being stronger for 
k > 260 (these bounds in fact apply to g(n) for all n with k distinct prime factors, and so cover 
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the case where n = P^). These bounds are quite weak: while from Hagedorn's calculations 
we have h(49) = 742, these bounds give h(49) < 10 15 and h(A9) < 10 40 respectively. We 
thus know relatively little about the explicit behavior of h(k) for k greater than 49. 

In this paper we address this gap using a new computational method for computing 
explicit upper bounds on h(k). This method gives bounds much stronger than those given 
by Kanold and Stevens; for example, this method gives a bound on h(A9) which is less than 
3 times the true value of h(49). This method is also much faster than that used by Hagedorn 
to calculate values of h(k), computing a bound on h(49) in seconds. We used this method 
to compute upper bounds on h(k) for k from 50 to 10,000: for all k in that range we find 

h(k) < 0.27749612254 k 2 log k 

a bound hundreds of orders of magnitude stronger than those given by Kanold and Stevens 
in this range. 

Our method is based on an expression for the function (p(b, m, k), which represents the 
smallest number x such that every sequence of m consecutive integers contains at least x 
integers coprime to Taking <p(n) to be Euler's totient function and ip min (m,i) to be the 
minimum value of ip(b, m, i) across all b, we prove 



m 
P~k 



i=l 



Pi 



i=2 
k-1 k 



r 

2^ 



+ E 



i=2 j=i+l 



LPiPj] 



i < 



for all m and k, where r — m mod Pk and where E is a positive correction arising due to 
constraints on the co-occurence of residues of the primes up to pk- This expression has the 
computationally nice property that the recurrent double sum is dominated by terms where 
% is small, and putting computational effort into calculating accurate values of (p min (r,i — 1) 
across all candidate r's for some set of low values of i allows us to efficiently compute strong 
lower bounds on ip(b, r,i — 1) for a large range of values of k. Since it is clear that 



< (p r . 



1) =>• h(k) < m 



this in turn allows us to efficiently compute strong upper bounds on h{k) across a large range 
of k. 

In the first three parts of this paper we prove this expression. In the last part we describe 
algorithms based on this expression that we use to compute explicit upper bounds on h(k). 



2 Preliminaries 

We take u>(a) to represent the number of distinct prime factors of a, Uk(a) to be the number 
of distinct primes that are factors of both a and Pk, and 4 (a) to be the lowest factor of a 
that is also a factor of Pk. 
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For any set of integers S we take F s (d) to be the number of integers in S that are divisible 
by d; for any integer b we take Fb jm (d) to be the number of integers in the sequence of m 
consecutive integers b + 1, . . . b + m that are divisible by d. 

We use the following result concerning divisors of members of arithmetic sequences. 

Theorem 2.1. Let d and n be two coprime squarefree integers. Then for any arithmetic 
sequence 

b + d,b + 2d,..., 

there necessarily exists a corresponding sequence of consecutive integers 

cb + l,cb + 2,... 

such that 

GCD{cb + x, n) = GCD{b + xd, n) for all x 

Proof. Choose integers c and z such that cd — zn = 1. Since c divides = zn + 1 it is clear 
that c and n are coprime and so we have 

GCD(c(b + xd) — xzn, n) = GCD(b + xd, n) for all x 

(because xzn shares all of ra's prime factors while the only prime factors that c(b + xd) 
shares with n are those of (b + xd)). Rearranging the left hand side (and using the fact that 
cd — zn = 1) we get 

c(b + xd) — xzn = cb + x(cd — zn) = cb + x for all x 

and the consecutive sequence starting at cb + 1 has the required property. □ 

Following from this we define c b (d) as follows: 

Definition 2.2. For some integer d with pi as its lowest prime factor, let y + d be the first 
term in the sequence b + 1, . . . b + m that has d as a divisor. We have then an arithmetic 
sequence 

y + d,y + 2d,...,y + F b>m (d) x d 

all of whose members are divisible by d and all of whom are in the sequence b + 1, . . . b + m 
. Then Cb(d) is an integer such that 

GCD(c b (d) + x, Pi_i) = GCD(y + xd, P;_i) for all x 

and so the sequence of consecutive integers starting at Cb(d) + 1 all have the same prime 
factors in common with P{-\ as the corresponding terms in the arithmetic sequence of integers 
between b + 1 and b + m that have d as a divisor. 
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3 A recurrent expression for <p(b, m, k) 

We now prove a recurrent expression for ip(b,m,k), the number of integers from b + 1 to 
b + m which are coprime to Pk- 

Theorem 3.1. For integers m, b and k we have 



ip(b,m,k) = m-^2F b)m (pi) + Y (u k (a) - 1) 
i=i 

Proof. The expression 



b<a<b+m 
uj k (a)>0 



m -^2Fb, m (Pi) 



undercounts ip(b,m, k) by Wfc(a) — 1 for each integer a in our sequence that has u>k(a) > 0, 
and so 

k 

ip(b,m,k) = m-^2, F b, m iPi) + Y ( w fc(a) - 1) 

i=l b<a<b+m 
uj k (a)>0 

as required. □ 
Theorem 3.2. For any p x | let S be the set of integers a G B that have 4(a) = p x . Then 

J^(a;fc(a) - 1) = F s(PiPx) 

aeS x<i<k 

Proof. Assume some p x | P^. For any a G S the total number of composites p x pi dividing a 
where Pi \ Pk is therefore equal to the number of prime factors p,i ^ p x of Pk which divide a. 
Since p x \ a this total is equal to Uk(a) — 1. Each composite pip x thus contributes 1 to the 
sum 

^K(a) - 1) 

for each a G S 1 which has as a divisor and so the total contribution that each such 
composite makes to that sum is F s (pip x ), and the result follows. □ 

Theorem 3.3. 

k k-i k 

Y M a )- 1 ) = Y F b,™{2Pj) + Y Y ^( C b(PiPj)> F b,m{PiPj)^ ~ X ) 
b<a<b+m j=2 i=2 j=i+l 

uj k (a)>0 
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Proof. The proof is inductive. For the base of the induction, we note from Theorem 3.2 that 

k 

1 >'■>:■,. 



Y {^k{a) - 1) = ^ F b)m (2pj 



b<a<b+m j=2 
Lu 1 (a)>0 

To prove induction we begin by assuming that 

k x k 

Y ~ X ) = Y F t>,m(2Pj) + Y Y V( C b(PiPj)> F b,m(PiPj)^ ~ 1) 

b<a<b+m j=2 i=2 j=i+l 

u x (a)>0 

holds for some x < k — 1. 

Let 5 be the set of integers in our sequence b + 1, . . . , b + m which are coprime to P x and 
which have p x +\ as a factor. Since all integers not in S have either already been counted or 
have co x+ i(a) = 0, we have 

E ("*(a)-l)= E Ma)-l) + Y,(Ma)-V 

b<a<b+m b<a<b+m a£S 

uj x+ i(a)>0 u) x (a)>0 



By definition ifc(a) = p x +\ for all a 6 S, and so from Theorem 3.2 we have 

E(^fc( a ) - 1) = Y F s(PjPx+i) 

aeS x+l<j<k 

We can rewrite the right hand side here as 

k 

Y F S {pip x+1 )= y Y 1 

x+l<i<k j=x+2 b<a<b+m 

Pw+lPj\a 
a coprime to P x 

For each pair p x +iPj we have an arithmetic subsequence consisting of those integers be- 
tween 6+1 and b + m that have p x +iPj as a factor. This subsequence contains F 0tm (PjPx+i) 
integers. The right hand side in the above expression contributes 1 for each member of this 
arithmetic subsequence which is coprime to P x . From Theorem 2.1 this arithmetic subse- 
quence is equivalent to a sequence of F biTn (j)jp x+1 ) consecutive integers starting at c b (p x+1 pj), 
and so the right hand side above contributes 1 for each member of this sequence which is 
coprime to P x , giving a total contribution of 

(f(c x+1:j} F bjm (pjp x+1 ),x) 

for each such pair. We thus have 

k 

Y( UJ k(a)-l)= Y f( c ^+hj^ F b,m(PjPx+i),x) 

aeS j=x+2 



and so 



k x+1 k 

^2 (wjfc(a) - 1) = ^2 F b,m(2Pj) + S vicbiPiPj), F b , m (piPj), i-l) 

j=2 i=2 j=x+2 



b<a<b+m 
u) x+1 (a)>0 



also holds for x + 1. This completes the induction and gives the required result. 



□ 



Finally, combining Theorems 3^ and 3^3 we have 
Theorem 3.4. 

k k k—1 k 

ip(b, m,k) = m-) j F bim (pi) + F b>m (2pj) + ^ Lp(c b (piPj), F bim {piPj),% - 1) 

i=2 t=2 i=i+l 



i=l 



/or a// 6, m and fc. 



4 A lower bound on (f m i n (m : k) 

We now give a lower bound on (p min (m, k), the minimum value of <p(b,m,k) across all b . 
This bound makes use of constraints on the co-occurence of residues of primes to We 
begin with a very obvious result, which we give without proof. 

Theorem 4.1. If d\m then 



F b , m {d) 



m 



d 



(6 + m) mod d <m mod d 

u 

Using this we get 

Theorem 4.2. For primes p and q let x = m mod p. If x > and q \ m — x + p then 



m 
P 



F b , m (pq) 



F b , m (p) 

Proof. Let x = m mod p. Assume x > 0, q\m — x + p and 

[m 

F b , m {p) = — 
P 



m 
pq 



Then since x > means p \ m from Theorem 4.1 we have 

(b + m) mod p < x 

Let 

w = 6 + m — ((6 + m) mod p) 



6 



and p divides every integer in the arithmetic sequence y,y — p, . . . ,y — (q — l)p. Since there 
are q terms in this sequence, one of these terms is also divided by q, and so 



However 
and so 



{b + m) mod pq < b + m — y — (q — l)p 

b + m — y = (b + in) mod p < x 
(b + in) mod pq < x + (q — l)p 



By assumption we have q \ in — x + p; by definition p | in — x and so we have pq \ in — x + p 
and x < p < pq. We thus get m mod pq = x — p + pq = x + (q — l)p and so 



(b + m) mod pq < m mod pq 



and from Theorem 4.1 we get the required result. 



□ 



Theorem 4.3. For any integer m there exists an integer b such that tp(b, m, k) = (p min (m, k) 
and such that for all odd primes p \ m — 1 we have 



Fb,m{p) 



m 
P 



F b ,m(2p) 



Til 
2p 



Proof. Let b be some integer such that <p(b, m, k) = (p m i n (m, k) and 2 | b+m. (To see that we 
will always be able to select such a b, note that for any a such that <£>(a, m, k) = (p m i n (m, k) 
we either have 2\a + mor2\a + m. If 2 | a + m then b = a satisfies our requirements, 
whereas if 2 \ a + m then 2 | a + m + 1 and so (p(a + 1, m, k) = (p(a, m, k) = {p min (m, k) and 
b = a + 1 satisfies our requirements.) For any p | m — 1 we have 



Fb, m (p) 



m 
P 



{b + m) mod p < m mod p 



(from Theorem 4.1 ) and so for any p \ m — 1 we have 
Fb,m(p) 



m 
P 



p\ (b + m) =^ 2p | (b + m) ^ F h ^ m (2p) 



m 
2p 



as required. 

Combining these results we get the following 
Theorem 4.4. For integer m let r = m mod and let 



□ 



E 



{i : 1 < % < k,Pi \ r, 2 | (r — (r mod p^ + Pi) or pi | r — 1} 
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Then 



m 



i=l 



Pi 



i=2 
fe-1 k 



r 

2p~i. 



+ E 



i=2 j=i+l 



PiPj 



for all m and k. 

Proof. Assume r and E as denned above. Let b be an integer for which the conditions in 
Theorem 4.3 hold. Since Euler's totient (p(n) gives the number of integers coprime to n in 
any sequence of n consecutive integers we have 



ip{b,m, k) = <p(P k ) 



rn 
P~ k 



+ (p{b,r, k) 



and we need only consider the value of tp(b, r, k). From Theorems 4.2 and 4.3 we see that 



FbAp) 



and F(, r (2p) 



r 

2p 



cannot hold for any prime counted in the definition of E, and so we have 



k pi fc 
^ Pi + ^ 

1=1 ^ % i=2 



r 

2p~t 



k k 

+ E<r-J2 F b,r(Pi) + F hA 2 Pi 

3=2 



1=1 



Combining this with the fact that by definition 

* - 1 ) < <P{Cb{PiPj), Eb,m{PiPj),i - I) 



m 



.PiPj. 



we get 



m 
Pk 



+ r-V — +V 



i=l 



Pi 



r 

2p~i 



PiPj. 



fe-i k 

+ e + 22 ^2 Lpmin 

i=2 l-j-' J i=2 j=i+\ 

k k k— 1 k 

- m ~ F >>,m(Pi) + ^2 F b,mi^Pj) + ^(^(PiPj), F b,m(PiPj), » ~ 1) 

i=l j=2 i=2 j=i+l 



From Theorem 3.4 the right hand side of this expression is equal to cp(b, m, k), which from 
Theorem 4.3 is equal to ip m i n ('m, k) and we get the required result. □ 
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12: 



16: 



20: 



Algorithm 1 The function (pi ow (m, k). This function requires that explicit values of 
{p min (m,k) are known for all k up to 6 and all m < P k . Known values of h(k) for k 
less than 50 are used only if the variable UseKnown is set. 
function (^/ ou ,(m,k) 

L <r- lf{P k )S 

4: r <r- m — S 

if k < 6 then return L + v (r, k) end if 
if r < 2pfc_i then return L end if 

if [/ seKnown = True and A; < 49 and r < then return L end if 

8: L^L + r-\l\ 



Pi 



+ 



2pi 



end for 



for i = 2 to A; do I f- L 
E^O 

for i = 2 to k do 

if ^ f r and (2 | (r — (r mod p^) + pi) or | (r — 1)) then 
E <- £ + 1 

end if 
end for 
i <- 2 
repeat 

j = i + 1 
repeat 



PiPj 
(m r 



if U > then L + U end if 

J <- J + 1 
24: until j = k or U < 

i <- i + 1 
until i — k 

if L > then return L else return end if 
28: end function 
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5 Computations 



We first describe Algorithm 1, which computes a recursive function ipi ow (m, k). Values of 



this function give a lower bound on (p m i n (m, k) as given in Theorem 4.4 



In this algorithm the variables L is used to hold the incrementally computed lower bound 



on f m in( m : k) . Lines 2 to 4 obtain a value for 



in 

Pk 



assign an initial value for L as in Theorem 



4.4, and obtain a value for r (again as in Theorem 4.4). 



We explicitly computed the value of ip min (m,i) for all i less than or equal to 6 and for 
all m less than P^, these provide stopping conditions for the recursion in (pi ow {m, k) (line 5). 

From Hagedorn p] we have explicit values of h(k) for k < 49, and we also have the 
general result that 2p^-i < h(k) for all k. Since fmin{iTi, k) = if m < h{k) we use both 
the explicit values and the general result as further stopping conditions for the recursion 
(lines 6 and 7). Hagedorn's explicit values for h(k) for k < 49 are used only if the variable 
UseKnown is set to true. 



Lines 8 and 9 calculate the first three terms in the left hand side of Theorem |4.4[ with 
lines 9 to 14 calculating the E term in that sum. 

Lines 16 to 26 give two nested loops containing recursive calls to (pi ow (-) and so calculating 
a lower bound on value of the double sum in Theorem 4.4 For each pair of values for indices 
i and j in these loops, the algorithm recursively gets a lower bound on i/3 m j n (m nOT , i — 1) as 
in Theorem 4A, placing this bound in a variable U. 

Finally, line 27 returns a lower bound on ip m i n (m, k). Since v?mm(^, k) cannot be less than 
(no sequence contains a negative number of primes) the algorithm returns the computed 
bound L only if L > 0; otherwise is returned. 

At lines 24 and 26 the inner and outer loops terminate at j = k and % — k — 1 respectively, 



as in the double sum in Theorem |4.4[ For efficiency the inner loop also terminates if the 
last obtained recursive lower bound value U was equal to 0: in this case all further recursive 
calls within this loop will also return a value of 0, b decreasing on each cycle 

of the inner loop. Similarly, the outer loop terminates if the first recursive bound value U 
obtained on the previous cycle of the inner loop had a value of 0: in this case recursive calls 
in all further cycles of the inner loop will also return a value of 0, because all values of m new 
in those further cycles will be less than the current value of m new . 

We implemented the function Lfi ow in the computer algebra system PARI |10] . We take 
b(k) to represent the lowest integer m such that ipi ow (m,k) > and so b{k) is an upper 
bound on h(k). We find b(k) using linear search across increasing values of m (Algorithm 

2). 

To compare values of b(k) with the values of h(k) calculated by Hagedorn for k from 1 to 
49 [2] we ran Algorithm 2 with start = 1, end = 49, initialM = 1 and variable UseKnown 
set to False. In this range the bound b(k) was less than 3 times the true value of h(k). 
Figure 1 graphs h(k), b(k), and Steven's and Kanold's bounds on h(k) in this range. 

To calculate values of b(k) up to k = 1000 we ran Algorithm 2 with start = 50, end = 
1000, variable UseKnown set to True, and initialM set to h(A9) = 742. 

For a given k Algorithm 2 calculates values of <fi ow (m,k) for increasing values of m 
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Algorithm 2 Calculating b(k) for k from start to end. 
m i— initialM 
for fc = start to end do 
while (pi ow (m,k) < 1 do 

m m + 1 
end while 
print (k,m) 
end for 



starting atm = b{k — 1) + 1 and continuing until is reached (that is, until ipi ow (m, k) > 0). 
Clearly the number of integers m for which Algorithm 2 must calculate (pi ow (m, k) grows as 
k grows. Algorithm 2 will thus be relatively slow for large k (in our runs Algorithm 2 took 
around 15 minutes to calculate 6(1000)). For this reason when calculating bounds b(k) for k 
greater than 1000 we took an alternate approach, using the function ipi ow to find the lowest 
integer Ck such that 

C k k 2 



Vic 



10000 



k > 



(see Algorithm 3). For each Ck we then have h(k) < (Ck A; 2 )/10000. To calculate values 
of b(k) up to k = 10000 we ran Algorithm 3 with start = 1001, end = 10000, variable 
UseKnown set to True, and initialC set to 10000. 



10000 



k) > do 



Algorithm 3 Calculating an upper bound on b(k) for k from start to end. 
Ck ^— initialC 
for k = start to end do 
S^C k 
while <p low ( 

Ck ^— Ck — 1 
end while 
if C k < S then 
Ck <— Ck + 1 
else 

while ip iow ( 

Cfc Ck + 1 
end while 
end if 

' c k k 2 



C k k 2 
10000 



k < 1 do 



print ^/c, 
end for 



10000 
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Figure 1 graphs b(k) for k from 50 to 10000 as obtained from these algorithms. We find 



b{k) < 0.27749612254 k 2 log k 

for all k in this range, and so this gives an upper bound on h(k) for these k. 

Figure 2 compares h{k), b{k), Kanold's bound and Steven's bound for k up to 49. Figure 
3 compares the logs of b(k), Stevens' bound, and Kanold's bounds for k up to 10000. From 
this graph we see that b{k) is hundreds of orders of magnitude stronger than Stevens' and 
Kanold's bounds in this range. 
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Figure 1: Graph of b(k) vs k for k from 50 to 10000 
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k 

Figure 3: Graph comparing the log of b(k) with the logs of Kanold's and Steven's bounds 
for k to 10000. 
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